ReproHack Hub

Browse ReproHack papers

Planning Support Systems for Long-Term Climate Resilience: A Critical Review

Authors: Supriya Krishnan, Nazli Yonca Aydin & Tina Comes

DOI: https://doi.org/10.1007/978-3-030-76059-5_24

Submitted by Supriya.kr09
Number of reviews: 1
Why should we attempt to reproduce this paper?
This article used an open-source python repository for its analysis. It is well-suited for reproduction as more literature evolves on the intersection of urban planning and climate change. The adapted code is published alongside the article.

Tags: Meta-analysis machine learning urbanism literature review Urban Knowledge Systems Topic modelling Planning Support Systems
Living HTA: Automating Health Technology Assessment with R

Authors: Robert A. Smith, Paul P. Schneider, Wael Mohammed

DOI: 10.12688/wellcomeopenres.17933.1

Submitted by rasmith3

Why should we attempt to reproduce this paper?
We think this is an interesting paper for anyone who wants to learn to build an API with the R package plumber. This is a novel method in health economics, but we believe will help improve the transparency of modelling methods in our field.

Tags: R Shiny Health Economics HTA plumber
Droplet impact onto a spring-supported plate: analysis and simulations

Authors: Michael J. Negus, Matthew R. Moore, James M. Oliver, Radu Cimpeanu

DOI: https://doi.org/10.1007/s10665-021-10107-5

Submitted by MNegus
Mean reproducibility score: 8.0/10 | Number of reviews: 1
Why should we attempt to reproduce this paper?
The direct numerical simulations (DNS) for this paper were conducted using Basilisk (http://basilisk.fr/). As Basilisk is a free software program written in C, it can be readily installed on any Linux machine, and it should be straightforward to then run the driver code to re-produce the DNS from this paper. Given this, the numerical solutions presented in this paper are a result of many high-fidelity simulations, which each took approximately 24 CPU hours running between 4 to 8 cores. Hence the difficulty in reproducing the results should mainly be in the amount of computational resources it would take, so HPC resources will be required. The DNS in this paper were used to validate the presented analytical solutions, as well as extend the results to a longer timescale. Reproducing these numerical results will build confidence in these results, ensuring that they are independent of the system architecture they were produced on.

Tags: HPC C CFD Fluid Dynamics DNS Mathematics Droplets Basilisk
Machine learning a model for RNA structure prediction

Authors: Nicola Calonaci, Alisha Jones, Francesca Cuturello, Michael Sattler, Giovanni Bussi

DOI: 10.1093/nargab/lqaa090

Submitted by giovannibussi

Why should we attempt to reproduce this paper?
The method is trained on the data that were available, but it is meant to be re-trainable as soon as new data are published. It would be great to be really sure that even someone else will be able to do it. In case we receive any feedback, we would be really happy to improve our Github repository so as to make the reproduction easier!

Tags: Python machine learning RNA bioinformatics
Accelerating the prediction of large carbon clusters via structure search: Evaluation of machine-learning and classical potentials

Authors: Bora Karasulu, Jean-Marc Leyssale, Patrick Rowe, Cedric Weber, Carla de Tomas

DOI: 10.1016/j.carbon.2022.01.031

Submitted by bkarasulu
Number of reviews: 1
Why should we attempt to reproduce this paper?
This paper presents a fine example of high-throughput computational materials screening studies, mainly focusing on the carbon nanoclusters of different sizes. In the paper, a set of diverse empirical and machine-learned interatomic potentials, which are commonly used to simulate carbonaceous materials, is benchmarked against the higher-level density functional theory (DFT) data, using a range of diverse structural features as the comparison criteria. Trying to reproduce the data presented here (even if you only consider a subset of the interaction potentials) will help you devise an understanding as to how you could approach a high-throughput structure prediction problem. Even though we concentrate here on isolated/finite nanoclusters, AIRSS (and other similar approaches like USPEX, CALYPSO, GMIN, etc.,) can also be used to predict crystal structures of different class of materials with applications in energy storage, catalysis, hydrogen storage, and so on.

Tags: Python HPC LAMMPS DFT interatomic potentials Python scripting AIRSS structure prediction density functional theory high-throughput machine-learning
Automatic learning of hydrogen-bond fixes in an AMBER RNA force field

Authors: Thorben Fröhlking, Vojtěch Mlýnský, Michal Janeček, Petra Kührová, Miroslav Krepl, Pavel Banáš, Jiří Šponer, Giovanni Bussi

Submitted by giovannibussi

Why should we attempt to reproduce this paper?
We do care about reproducibility. In case we receive any feedback, we would be really happy to improve our Github repository and/or submitted manuscript so as to make the reproduction easier!

Tags: Python HPC machine learning Molecular Dynamics
Synergistic coupling in ab initio-machine learning simulations of dislocations

Authors: Petr Grigorev, Alexandra M. Goryaeva, Mihai-Cosmin Marinica, James R. Kermode, Thomas D. Swinburnea

DOI: https://arxiv.org/abs/2111.11262

Submitted by jameskermode

Why should we attempt to reproduce this paper?
Systematically improvable machine learning potentials could have a significant impact on the range of properties that can be modelled, but the toolchain associated with using them presents a barrier to entry for new users. Attempting to reproduce some of our results will help us improve the accessibility of the approach.

Tags: HPC interatomic potentials machine learning
Sensitivity and dimensionality of atomic environment representations used for machine learning interatomic potentials

Authors: Berk Onat, Christoph Ortner and James Kermode

DOI: 10.1063/5.0016005

Submitted by jameskermode

Why should we attempt to reproduce this paper?
Popular descriptors for machine learning potentials such as the Behler-Parinello atom centred symmetry functions (ACSF) or the Smooth Overlap of Interatomic Potentials (SOAP) are widely used but so far not much attention has been paid to optimising how many descriptor components need to be included to give good results.

Tags: HPC descriptors interatomic potentials machine learning
Optimizing the Use of Carbonate Standards to Minimize Uncertainties in Clumped Isotope Data

Authors: Ilja J. Kocken, Inigo A. Müller, Martin Ziegler

DOI: 10.1029/2019GC008545

Submitted by japhir

Why should we attempt to reproduce this paper?
Even though the approach in the paper focuses on a specific measurement (clumped isotopes) and how to optimize which and how many standards we use, I hope that the problem is general enough that insight can translate to any kind of measurement that relies on machine calibration. I've committed to writing a literate program (plain text interspersed with code chunks) to explain what is going on and to make the simulations one step at a time. I really hope that this is understandable to future collaborators and scientists in my field, but I have not had any code review internally and I also didn't receive any feedback on it from the reviewers. I would love to see if what in my mind represents "reproducible code" is actually reproducible, and to learn what I can improve for future projects!

Tags: R tidyverse emacs literate earth sciences clumped isotopes org-mode geology
Where should new parkrun events be located? Modelling the potential impact of 200 new events on socio-economic inequalities in access and participation

Authors: Schneider PP, Smith RA, Bullas AM, Bayley T, Haake SS, Brennan A, Goyder E

Submitted by hub-admin
Mean reproducibility score: 7.0/10 | Number of reviews: 3
Why should we attempt to reproduce this paper?
If all went right, the analysis should be fully reproducible without the need to make any adjustments. The paper aims to find optimal locations for new parkruns, but we were not 100% sure how 'optimal' should be defined. We provide a few examples, but the code was meant to be flexible enough to allow potential decision makers to specify their own, alternative objectives. The spatial data set is also quite interesting and fun to play around with. Cave: The full analysis takes a while to run (~30+ min) and might require >= 8gb ram.

Tags: R GDAL GEOS GIS Shiny PROJ
Open Trade Statistics

Authors: Pachá (Mauricio Vargas Sepúlveda)

Submitted by hub-admin

Why should we attempt to reproduce this paper?
The focus of the project is reproducibility. Here we show the differences to access data compared to similar initiatives: https://ropensci.org/blog/2019/05/09/tradestatistics/. Also, similar projects have obscure parts, while our exposes the code from raw data downloading to dashboard creation.

Tags: R Shiny

Search for papers

Filter by tags

Python R GDAL GEOS GIS Shiny PROJ Galaxies Astronomy HPC Databases Binder Social Science Stata make Computer Science Jupyter Notebook tidyverse emacs literate earth sciences clumped isotopes org-mode geology eyetracking LaTeX Git ArcGIS Docker Drake SVN knitr C Matlab Mathematica Meta-analysis swig miniconda tensorflow keras Pandas SQL neuroscience robotics deep learning planner reiforcement learning Plasma physics Hybrid-PIC EPOCH Laser Gamma-ray X-ray radiation Petawatt Fortran plasma PIC physics Monte Carlo Atomistic Simulation LAMMPS Electron Transport DFT descriptors interatomic potentials machine learning Molecular Dynamics Python scripting AIRSS structure prediction density functional theory high-throughput machine-learning RNA bioinformatics CFD Fluid Dynamics OpenFOAM C++ DNS Mathematics Droplets Basilisk Particle-In-Cell psychology Stan Finance SAS Replication crisis Economics Malaria consumer behavior number estimation mental arithmetic psychophysics Archaeology Precipitation Epidemiology Parkrun Health Health Economics HTA plumber science of science Zipf networks city size distribution urbanism literature review Preference Visual Questionnaire Mann-Whitney Correlation Conceptual replication Cognitive psychology Multinomial processing tree (MPT) modeling #urbanism #R k-means cluster analysis city-regions Urban Knowledge Systems Topic modelling Planning Support Systems Software Citation Quarto snakemake Numerical modelling Ocean climate physical oceanography apptainer oceanography All tags Clear tags

Key

Associated with an event
Available for general review
Public reviews welcome

Papers

Browse ReproHack papers

Authors: Supriya Krishnan, Nazli Yonca Aydin & Tina Comes

DOI: https://doi.org/10.1007/978-3-030-76059-5_24

Submitted by Supriya.kr09

Authors: Robert A. Smith, Paul P. Schneider, Wael Mohammed

DOI: 10.12688/wellcomeopenres.17933.1

Submitted by rasmith3

Authors: Michael J. Negus, Matthew R. Moore, James M. Oliver, Radu Cimpeanu

DOI: https://doi.org/10.1007/s10665-021-10107-5

Submitted by MNegus

Authors: Nicola Calonaci, Alisha Jones, Francesca Cuturello, Michael Sattler, Giovanni Bussi

DOI: 10.1093/nargab/lqaa090

Submitted by giovannibussi

Authors: Bora Karasulu, Jean-Marc Leyssale, Patrick Rowe, Cedric Weber, Carla de Tomas

DOI: 10.1016/j.carbon.2022.01.031

Submitted by bkarasulu

Authors: Thorben Fröhlking, Vojtěch Mlýnský, Michal Janeček, Petra Kührová, Miroslav Krepl, Pavel Banáš, Jiří Šponer, Giovanni Bussi

Submitted by giovannibussi

Authors: Petr Grigorev, Alexandra M. Goryaeva, Mihai-Cosmin Marinica, James R. Kermode, Thomas D. Swinburnea

DOI: https://arxiv.org/abs/2111.11262

Submitted by jameskermode

Authors: Berk Onat, Christoph Ortner and James Kermode

DOI: 10.1063/5.0016005

Submitted by jameskermode

Authors: Ilja J. Kocken, Inigo A. Müller, Martin Ziegler

DOI: 10.1029/2019GC008545

Submitted by japhir

Authors: Schneider PP, Smith RA, Bullas AM, Bayley T, Haake SS, Brennan A, Goyder E

Submitted by hub-admin

Authors: Pachá (Mauricio Vargas Sepúlveda)

Submitted by hub-admin

Search for papers

Filter by tags

Key